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Abstract. The technique of degree of randomness is used to model the correlations in 
sequences containing various subsignals and noise. Kolmogorov stochasticity parameter enables 
to quantify the randomness in number sequences and hence appears as an efficient tool to 
distinguish the signals. Numerical experiments for a broad class of composite signals of regular 
and random properties enable to obtain the qualitative and quantitative criteria for the behavior 
of the descriptor depending on the input parameters typical to astrophysical signals. 



1. The method 

The correlations functions and the power spectra are common and efficient tools for the study of 
correlations in the signals. Astrophysical signals typically are superposition of various subsignals, 
regular and random, by features comparable to each other and of weaker ones, i.e. perturbations 
or the noise. The procedure of analysing of the needed signal or signals, their separation from 
the noise is a common problem while dealing with observations and measurements. 

The Kolmogorov stochasticity parameter technique enables to quantify the randomness of 
sequences of number theory or dynamical systems [H EJ [31 HI [5] . 

The technique of the degree of randomness has been applied to the Cosmic Microwave 
Background (CMB) temperature sky maps and to the X-ray flux data of the clusters of galaxies. 
The former data were those obtained by the Wilkinson Microwave Anisotropy Probe (WMAP) 
during 7- year observations [6j[7], while the X-ray data were obtained by XMM-Newton satellite 
providing a particularly accurate and complete sky survey (see [SJHI]). In the case of CMB, the 
Kolmogorov function enabled to separate signals of different origin, e.g. the Galactic and non- 
Galactic ones, and to detect point sources in the CMB maps [TU] (see Fig.l). Concerning the 
X-ray clusters, it was shown that their X-ray images do possess correlation in the pixelized flux 
data peculiar to the gravitational potential of the galaxy clusters [IT]. This technique resembles 
the methods of the dynamical systems applied to nonlinear problems (e.g. |12j). 

A crucial step in these studies is the modeling and analysis of generated systems, which 
enables to reveal the behavior of the stochasticity parameter in the case of a given signal and 
then to consider the application of this technique for real signals [13J. Below we represent the 
results of numerical experiments for a broad class of signals. 

Kolmogorov stochasticity parameter is introduced for a sequence {X±, X2, ■ ■ ■ , X n } of real 
random variable X sorted in growing order X\ < X2 < . . . < X n . Then the theoretical 
distribution function is pjj [2] 



F(X) = n ■ (probability of the event x < X). 



(1) 



the stochasticity parameter is defined as 

X n = sup\F n (x) -F(x)\ (2) 

X 

where the empirical distribution function is 

F n (X) = (number of the elements Xi which areless than X), 

and 

r 0, X < xx 

F n {X) = \ k/n, x k < X < x k+1 (3) 
I 1, x n <X. 

Then for the limit 

lim P{X n < A} = $(A) , (4) 

n— too 

where 

+oo 

$(A) = (- 1 ) fc e _2fc2A2 , $(0) = 0, A > , (5) 

k=— oo 

exists at uniform convergence and independent on F. 




Figure 1. The Kolmogorov's function $ for the portion of the 7- year temperature CMB map 
obtained by WMAP. 

2. Random- regular sequences 

We consider a broad class of sequences, i.e. those composed of random x n and regular 
y n = — — ^™° d - (a, b are prime numbers) sub-sequences within (0, 1) 

z n = ax n + (1 - a)y n . (6) 

The parameter a varies within [0,1] defining random sequences at 1 and regular ones at 0, so 
that by mutually fixing the pair a, b we get new regular sequences. 



For z n we have 
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F(X) = 



a < X < 1 - a 



(7) 
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1-a < X < 1 
X > 1. 



Figure 2 shows the results of the numerical experiments for 100 sequences, each sequence 
containing 10000 elements. Each sequence is divided into 50 subsequences, i.e. m runs through 
values 1, 50, and for each of them the parameter 3>(A n ) m is calculated and then the empirical 
distribution function G($) m of these numbers is obtained. When the original sequences are 
random, this distribution have to be uniform according to Kolmogorov's theorem. To test 
that, x 2 fo r the functions G($) m and Go($) = <3? have been calculated, i.e. one parameter x 2 ls 
calculated for each of the 100 x 101 sequences. For 100 x 2 values per each value of a, we obtained 
the mean and error values for \ z '■> i- e - f° r each pair a, b we have a plot of the dependence of % 2 



Figure 2. The 3D x 2 f° r the Kolmogorov's function for the sequence z n vs a and the parameter 
a. 



3. Parameters of the regular sequences 

At certain values of the parameter a for different values of b the dependences in Fig. 2 are 
monotonic, while for others they do have maxima. To study this effect, we introduce a parameter 
A which is the difference of two values in those plots: maximal value of x 2 an d minimal value 
in the range a G (0,a max ), if a max is the position of the maximal value. Obviously, A is zero 
when the dependence is monotonic and no extrema do exist. Then we calculate A for fixed b 
and for each value of primary a = 2, b . 

The remarkable feature of the results is the strict mirror symmetry in FigJ3]in the dependence 
of A vs a, although no periodicity is found by Fourier analysis. 

The mirror symmetric plots can be hence subjects of particular study, e.g. in two versions: 
first, of the distribution of A and, second, the spacing between non-zero A and their distribution. 



on a. 





Figure 3. The dependence of A vs the parameter a. 



The null values of A are skipped and also - since due to the mirror symmetry each A has its 
pair - only one of each pair is taken into account. 

The results we give in FigH] where the number of non-zero As from Figgis given in growing 
order. The number of non-zero As appear to be proportional to b. 
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Figure 4. Sorted amplitudes of As from Figj3j 



4. Sum of fluctuations: large N limit 

The next problem we consider is the properties of the signal being a sum of random and regular 
fluctuations, each of sequences of 10000 elements and of the same standard deviation. 
The regular sequences have been chosen as 

1 N 

a t = —= V Compact(xi, -1, 1), (8) 



Random N=1000 
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Figure 5. x 2 frequency for random and regular sequences vs the Gaussian distribution. 
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Figure 6. Kolmogorov function <3? for the random and regular sequences in FigJSJ 



where 



is compactiiied arithmetical sequence within the interval (—1, 1), with step 1/k. 
The random sequences are given by 



1 N 



2_\. Random(—l, 1). 



At large number of sequences each new sequence y n is taken as the continuation of the former 
arithmetical progression. 
Here 

Compact(x,p, q) = q + xmod(p — q) 

indicate multiples of (p — q) from x having the value within the range (p,q),p < q- 

The results for random and regular sequences, 10000 each, are given in Figj5]for x 2 , when the 
number of the fluctuations vary from ./V = 1000 to 100000. The x 2 shows that for both, random 
and regular sequences, we deal with a Gaussian limiting distribution, in accordance with the 
Central Limit theorem which states that for large enough values of N, both sequences aj and bi 
tend to Gaussian sequences with the same a and /i independent on N. So, although differences 
are seen in the Gaussians, namely, the standard deviations are larger for the regular case, the 
X 2 are similar. 

For the Kolmogorov function $ the situation is rather different. When the Gaussians do 
appear both for random and regular sequences (as expected), the behaviors of is different 
and enables to separate them, as shown in FigJUJ Namely, it is close to a homogeneous function 
for random sequences and $ = 1 for regular ones. Kolmogorov's function therefore enables to 



distinguish the superposition of random and regular sequences, even though both are tending 
to Gaussians. 

Finally, we have probed the dependence of the results on the length of the sequences: the 
dependence on the number of the fluctuations within 1000 — 100000 is rather weak, x 2 varying 
around 10~ 8 — 10~ 9 . This confirms the universality of the obtained behavior of for both 
random and regular fluctuations. 



5. Results 

The performed analysis revealed the behavior of the Kolmogorov distribution vs the properties 
of the generated signals. To describe datasets which contain both regular and stochastic 
components, we considered sequences scaled by a single parameter a, indicating the ratio of 
those components. 

Quantitative and qualitative criteria have been obtained for the Kolmogorov distribution 
at numerical experiments for broad class of random and regular sequences depending on a 
parameter. 

a) The existence of the critical value for a has been shown, when the monotonic decay of the 
frequency count of the Kolmogorov distribution is transformed to a function with an extremum. 

b) The dependence of scalings and spacings vs that parameter shows mirror properties both 
in the amplitude and distribution of the frequency counts of the function <E>. 

c) The behavior of the randomness of a signal composed of ./V subsignals at large N limit has 
been studied, where the Kolmogorov function acts as an informative descriptor. Particularly, 
the descriptor at large N enables to distinguish the initial set of the fluctuations, even when 
the superposition both of random and regular subsignals is not informative since tends to a 
Gaussian in accordance to the Central Limit theorem. 

The studied properties are typical, for example, for astrophysical datasets, when the sought 
signals are superposed with regular and random fluctuations of various origin, and hence the 
behaviors revealed at the numerical experiments due to the universality of the technique will 
enable its informative application to real data. 
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